High-dimensional indexing is an important area of current re-search, especially for range and kNN queries. This work in-troduces clustering for the sake of indexing. The goal is to de-velop new clustering methods designed to optimize the data partitioning for an indexing-specific tree structure instead of finding data distribution-based clusters. We focus on iDis-tance, a state-of-the-art high-dimensional indexing method, and take a basic approach to solving this new problem. By uti-lizing spherical clusters in an unsupervised Expectation Max-imization algorithm dependent upon local density and cluster overlap, we create a partitioning of the space providing bal-anced segmentation for a B+-tree. We also look at the novel idea of reclusterin...
The k-means algorithm with cosine similarity, also known as the spherical k-means algorithm, is a po...
More and more data are produced every day. Some clustering techniques have been developed to automat...
Clustering high-dimensional data has been a major challenge due to the inherent sparsity of the poin...
Indexing high dimensional data has its utility in many real world applications. Especially the infor...
Abstract—In this paper, we introduce the ClusterTree, a new indexing approach to representing cluste...
In this paper, we present an efficient B +-tree based indexing method, called iDistance, for Kneares...
The notorious iodimensionality curseln is a well-known phenomenon for any multi-dimensional indexes ...
Abstract: In order to improve the query efficiency, K-means cluster approach is often used to estim...
Abstract The notorious “dimensionality curse ” is a wellknown phenomenon for any multi-dimensional i...
The notorious "dimensionality curse" is a well-known phenomenon for any multi-dimensional indexes at...
Abstract. Pyramid Technique and iMinMax(θ) are two popular high-dimensional indexing approaches that...
The efficient indexing and searching of complex data is an increasing need in order to face the size...
In data mining domain, high-dimensional and correlated data sets are used frequently. Working with h...
In this article, we present an efficient B + -tree based indexing method, ca...
High-dimensional clustering is a method that is used by some content-based image retrieval systems t...
The k-means algorithm with cosine similarity, also known as the spherical k-means algorithm, is a po...
More and more data are produced every day. Some clustering techniques have been developed to automat...
Clustering high-dimensional data has been a major challenge due to the inherent sparsity of the poin...
Indexing high dimensional data has its utility in many real world applications. Especially the infor...
Abstract—In this paper, we introduce the ClusterTree, a new indexing approach to representing cluste...
In this paper, we present an efficient B +-tree based indexing method, called iDistance, for Kneares...
The notorious iodimensionality curseln is a well-known phenomenon for any multi-dimensional indexes ...
Abstract: In order to improve the query efficiency, K-means cluster approach is often used to estim...
Abstract The notorious “dimensionality curse ” is a wellknown phenomenon for any multi-dimensional i...
The notorious "dimensionality curse" is a well-known phenomenon for any multi-dimensional indexes at...
Abstract. Pyramid Technique and iMinMax(θ) are two popular high-dimensional indexing approaches that...
The efficient indexing and searching of complex data is an increasing need in order to face the size...
In data mining domain, high-dimensional and correlated data sets are used frequently. Working with h...
In this article, we present an efficient B + -tree based indexing method, ca...
High-dimensional clustering is a method that is used by some content-based image retrieval systems t...
The k-means algorithm with cosine similarity, also known as the spherical k-means algorithm, is a po...
More and more data are produced every day. Some clustering techniques have been developed to automat...
Clustering high-dimensional data has been a major challenge due to the inherent sparsity of the poin...